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Abstract 

We consider an infectious disease spreading along the edges of a network 
which may have significant clustering. The individuals in the population 
have heterogeneous infectiousness and/or susceptibility. We define the out- 
transmissibility of a node to be the marginal probability that it would infect 
a randomly chosen neighbor given its infectiousness and the distribution of 
susceptibility. For a given distribution of out-transmissibility, we find the 
conditions which give the upper [or lower] bounds on size and probability 
of an epidemic, under weak assumptions on the transmission properties, but 
very general assumptions on the network. We find similar bounds for a given 
distribution of in-transmissibility (the marginal probability of being infected 
by a neighbor). We also find conditions giving global upper bounds on size 
and probability. The distributions leading to these bounds are network- 
independent. In the special case of networks with high girth (locally tree- 
like), we are able to prove stronger results. In general the probability and size 
of epidemics are maximal when the population is homogeneous and minimal 
when the variance of in- or out-transmissibility is maximal. 
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1. Introduction 

The spread of infectious disease is governed by many different factors which vary on 
the individual level. Heterogeneity in the population comes from a number of sources 
including, but not limited to, genetic diversity, previous infections, vaccination history, 
or existence of co-infections. In this paper we investigate the effects of heterogeneity on 
disease spread, focusing on the effect of simultaneous heterogeneities in infectiousness 
and susceptibility. 

We consider the spread of infectious diseases in networks as shown in figure [T] 
Individuals in the population are modeled as nodes and potentially infectious contacts 
are modeled as edges between the corresponding nodes. We consider the spread of 
an SIR disease, that is the nodes are divided into three compartments: Susceptible, 
Infected, and Recovered. A susceptible node may be infected by an infected neighbor. 
Following infection, the newly infected node may infect some, all, or none of its 
neighbors and then recover. After recovery, a node cannot be reinfected. Typically 
in a large network outbreaks are either small or large (in a sense made more formal 
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Figure 1: The spread of disease in a network. The outbreak begins with a single infected 
individual [large empty circles] and then spreads along edges to others. The infected nodes 
recover with immunity [large filled circles]. Eventually the outbreak dies out. 

in section [2]) . We are primarily interested in what controls the probability of large 
outbreaks and the fraction of nodes infected in a large outbreak. 

Before discussing earlier results, we introduce some terminology. The transmissibil- 
ity T uv is the probability that an infection of node u would result in direct infection 
of the neighbor v. The in-transmissibility Ti n (v) is the marginal probability that a 
neighbor of v would infect v given the characteristics of v, and the out-transmissibility 
T ou t{u) is the marginal probability that u would infect a neighbor given the charac- 
teristics of u. Both the in- and out-transmissibility necessarily have the same average, 
(T) . These definitions will be made more precise in section [2] 

Most network-based epidemic models assume homogeneous transmissibility T uv — 
(T) between all pairs of neighboring nodes. Models that do allow heterogeneities 
generally show that they reduce the probability or size of epidemics [U [THl UHl I2H E2 ■ 
For an arbitrary network with homogeneous susceptibility [Ti n (v) = (T) for all v], but 
heterogeneous infectiousness, |16j showed that epidemics are most likely and largest 
if infectiousness is homogeneous [T out (u) = (T) for all u]. It was noted by [29] that 
the same argument shows that with homogeneous susceptibility epidemics are least 
likely and smallest if infectiousness is maximally heterogeneous (T out = for a fraction 
1 — (T) of the population and T out — 1 for the remainder) . The recent work of [HJ Q2] 
considered the effect of heterogeneity on a specific subclass of unclustered networks 
(variously called Molloy-Reed networks [23j or Configuration Model networks [26 ), 
finding similar results. One of these, [21], studied simultaneous heterogeneities in 
susceptibility and infectiousness, showing that for given (T), the same cases give the 
upper and lower bounds on probability while epidemics are largest if susceptibility is 
homogeneous and smallest if susceptibility is maximally heterogeneous. We are un- 
aware of any work which has considered simultaneous heterogeneities in infectiousness 
and susceptibility in networks with clustering or even in unclustered networks more 
general than Molloy-Reed networks. 

In this paper we investigate the spread of epidemics in which Ti n and T out can 
simultaneously be heterogeneous, using techniques from |16j and |21j . We will consider 
both clustered and general unclustered networks. Clustered networks are more difficult 
because of the existence of short cycles, and so a stronger assumption will be made for 
them. 

Often only the distribution of T out (or, more rarely, of T,- n ) would be available early 
in an outbreak. If we know the distribution of T outl it does not in general uniquely 
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determine the distribution of T in and so we cannot fully predict the final details of 
an outbreak. Our focus is on identifying the best and worst case scenarios given the 
distribution of T out (or Ti„), thus helping to provide policy makers with knowledge of 
what to expect and how best to mitigate it. 

Mathematical theories modeling the spread of infectious diseases have been de- 
veloped in a number of fields [TH [H |2j E] . The techniques used include differential 
equations, stochastic models, agent-based simulations, and network-based approaches. 
The differential equations approaches may be thought of as a mean-field approximation 
to a subclass of network models, while the stochastic and agent-based approaches can be 
made formally equivalent to network-based methods [T3J [23] . Consequently results for 
networks will apply to other models as well. Network epidemic models have primarily 
been studied by the statistics community [30j [TBI El HI 13 E2 [3] and the statistical 
physics/applied mathematics communities HOI CHI I2H IE HH HOI HI] • In general 
the statistics community has produced more rigorous results, but has considered more 
restricted classes of networks. The physics and applied mathematics communities have 
considered a wider range of networks, but the results are less rigorous. The interaction 
between these fields has been relatively sparse, leading to repeated discoveries of some 
results and a lack of cohesion in the topics studied. We attempt to bring some of these 
different approaches together in this paper. 

This paper is structured as follows: in section [2] we introduce the model and clarify 
definitions. In section [3] we consider epidemics spreading on general networks. In 
section|4]we find stronger results for networks with no short cycles. Finally in section|5] 
we discuss extensions and implications of our results. 

2. Epidemics in networks 

We consider the spread of disease on a network G. An outbreak begins when a single 
node (the index case) chosen uniformly from the population is infected. The disease 
spreads from an infected node u to a neighboring susceptible node v with a probability 
equal to the transmissibility T uv . Each infected node attempts to infect each of its 
neighbors and then recovers (and is no longer susceptible or infected). The outbreak 
ends when no infected nodes remain. 

This section, like Gaul, is divided into three parts. First we describe the neighbor- 
to-neighbor transmissibility T uv . This will depend on the characteristics of both u and 
v. We then introduce the concept of an Epidemic Percolation Network, which is a tool 
to study the routes of transmission in a given network. We finally discuss tools which 
will be used to make the concept of a "large network" rigorous. 

2.1. Transmissibility 

Following [21 , we assume that the factors influencing infectiousness of node u and 
susceptibility of node v may be summarized in I u and S v . In general, these may be 
vector-valued functions (though with few exceptions they are taken to be scalars in 
the literature). For example, T u may represent it's viral load, duration of infection, 
and willingness or ability to leave work if sick, while S v may represent v's previous 
vaccination history, genetic predisposition to infection, and previous exposure to related 
infections. If u and v are neighbors, the transmissibility T uv is then 
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for some function T. The function T(I, S) is the probability of transmission from a 
node with infectiousness I to a node with susceptibility S assuming that the nodes are 
joined by an edge. We may think of T uv as defined only for neighboring nodes, or we 
may take T uv = X{u,v}T(X u , S v ) where X{u,v} = if {u, v} is not an edge and 1 if it is. 

We assume that X and S are assigned independently, using the probability density 
functions P(X) and P(S) (although we use the same symbol P for both, we assume 
that the two functions are different). Particularly if X and S are vectors, we may not 
be able to clearly define which of two nodes is "more infectious" (i.e., there may not be 
a well-defined ordering). For example, with a sexually transmitted disease, we might 
have u\ and u-i infected, with u± having a high viral load and regular condom use (with 
occasional lapses), while ui has a low viral load but no condom use. Let us assume 
they have contacts with susceptibles v\ and V2 where v± has a high level of resistance, 
and thus will only be infected by a large dose, while «2 has no immune protection and 
thus will be infected by even a small dose. Under these assumptions, u± is more likely 
to infect v\, while ui is more likely to infect Vi. Which is "more infectious" depends 
on the test susceptible considered. 

The probability that u infects a neighbor (prior to knowing S for that neighbor) is 
given by the out-transmissibility of u 



and the probability that v would be infected by a neighbor is given by the in-transmissibility 



At times it will be convenient to use T out (X) and Tj n (<S) [rather than T out (u) and 
Tin{v)] to denote the out- and in-transmissibility of arbitrary nodes with X and iS 
respectively. When the concepts of being "more infectious" and "more susceptible" 
are clearly defined, T in (S) and T out (X) are invertible functions. However, because the 
ordering is not well-defined in general, they may not be invertible. If they are invertible, 
it is often convenient to change variables and set X u = T out (u) or S v = Ti n (v). We 
will do this frequently in section [3j where we restrict our attention to cases where the 
ordering described above is well-defined. 

From P(S) and P(X), we may find the distributions of Ti„ and T out . We use 
Qi n (T in ) to denote the probability density function for the in-transmissibility T in and 
Q out (Tout) to denote the probability density function for the out-transmissibility T out . 
The averages / T in Q in (T in )dT ln and /„ T out Q out (T out )dT out are both equal to (T). 

Given distributions of X and S and the function T, there is always a Qj n and Q ou t 
pair that result. Also, given a Q in or a Q out it is always possible to find P(X), P(S), and 
T that are consistent. For example, given any Qi n , for each node v we assign a T n (v) 
from Qi n and set S v — Ti n (v). Then T(X,S) = S is consistent with Qi n and yields 
Qout(T ou t) = &(T ut — (T))- This means that for any distribution of in-transmissibility, 
it is possible that the infectiousness of nodes is homogeneous. Although it is possible 
to find a Qi n for any Q ou t (and vice versa) not all pairs Q ou t and Qi n are compatible. 
For example, if Qi n = (1 — (T))d(T in ) + (T) 5(T in — 1) (i.e., susceptibility is maximally 
heterogeneous) then the out-transmissibility must be homogeneously distributed; no 
other distribution is possible. This particular example will be important in Section [4] 
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Although in principle X and S may be vector-valued, they frequently are assumed to 
be scalars with the transmissibility between two neighbors given by (for example [71121] ) 

T uv = T(l u , S v ) = 1 - exp(-aX a S v ) (1) 

A number of disease models yield this form. For example: let a be the rate at which 
virus from an infected person reaches a susceptible person. Let X u be the infectious 
period of u. Let S v be the probability that a virus reaching v causes infection. Then 
the probability p that v has not become infected satisfies p = —aS v p. Integrating this 
over the infectious period X u of u yields equation (JT|) . 

We need one final concept related to the transmissibility. Let a node u be given, 
and let V = {v±, . . . , v m } be a subset of the neighbors of u. Assume we know S — 
(S Vl , . . . , S Vm ), but not X u . Define 

&n(V, $)= I II t 1 T ( X > S v )]P{X)alX . (2) 
J vev 

This is the probability that u will not infect any node in V given knowledge of S for 
each v S V, but marginalized over the possible values of X for u. We may similarly 
define 



iP in (V) = I 4> in (V,S)P(S)dS , (3) 
\l-T out )^Q out (T out )dT out . (4) 



This is the probability that u will not infect any v G V marginalized over S of v G V 
and the values of I for u. If \V\ = 1, then ipi n (V) = 1 — (T), which will be important 
later when we consider unclustered networks. 

2.2. Epidemic Percolation Networks 

Given a network G, the distributions P(X) and P(S), and the function T(X,S), we 
assign X and 5 to each node of G. We then create a new directed network S which is 
an Epidemic Percolation Network (EPN) [12 as follows: the nodes of £ are the nodes 
of G. For each edge {u, v} of G, we place directed edges (u,v) and (v,u) into £ with 
probability T uv and T vu respectively. The original network G gives the paths a disease 
could follow, while a realization of £ gives the paths the disease will follow (if given 
the chance) for a simulation. 

The out-component of a given node u found by assigning X and S and generating 
an EPN comes from the same distribution as the nodes infected by the dynamic 
epidemic process described earlier with u as the index case. The processes are formally 
equivalent. 

To motivate some definitions, we assume sufficiently high transmissibility that there 
are nodes in £ with giant in- or out-components [B]. We define H out to be those nodes 
with a giant in-component, and Hi n to be those nodes with a giant out-component 
in £. We define H scc = Hi n n H out . H scc will almost surely be a strongly connected 
component. H in is the in-component of H scc and H out is its out-component. In general, 
infection of any u € Hi n results in infection of all nodes in H out and occasionally a few 
other nodes (if u ^ H scc ). We define such an outbreak to be an epidemic. If u ^ -ffm, 
then a small self-limiting outbreak occurs. 
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For large values of N = \G\, the probability of an epidemic is given by y = 
E[|Zfj n |]/iV and the expected fraction infected in an epidemic is given by A = E[\H out \]/N+ 
O (log N/N). As ./V grows, \H out \/N approaches E[|if out |]/./V, and so the size of a single 
epidemic in a large population closely approximates expected size of epidemics (note 
that if we include non-epidemic outbreaks in the average, this does not hold). 

If the directions of arrows in the EPNs are reversed, then Hi n and H out interchange 
roles. Consequently, replacing T uv = T{X Ul S v ) with T uv = T(1 V ,S U ) interchanges the 
size and probability. As such, results derived for the probability of an epidemic also 
apply to the size. 

2.3. Large Networks 

The results we derive will be appropriate in the limit of "large networks" . However, 
in practice we are usually interested in a single given network. Unfortunately \G\ — > oo 
is a vague concept when we are given a single, finite network. There are many ways to 
increase its size, with different impacts on epidemics. In this section, we define what 
is meant by \G\ — > oo in a way that allows us to produce rigorous results. 

Consider a sequence of networks {G n } which satisfy \G n \ — > oo as n — > oo. We 
define an open ball Bd(u) to be a network centered at a node u such that all nodes 
v E Bd(u) are at most a distance d from u. Given a network G, we define Pc(Bd(u)) to 
be the probability that if we choose a node u randomly from G, then the set of nodes 
of distance at most d from u is isomorphic to Bd(u) (with the isomorphism mapping 
ttto u). 

We define sequential convergence of local statistics to mean that given any d and 
Bd(u), Pc n (Bd(u)) = PG d (Bd(u)) for all n > d. For the results developed later, all that 
is strictly needed is that Pc n {Bd{u)) converges as n — > oo, but the stronger statement 
that for n > d they do not change makes the proofs simpler. This means that for 
large enough n, networks have the same "small-scale" structure, and the size of what 
is considered "small-scale" increases with \G\. We restrict our attention to sequences 
which have sequential convergence of local statistics. 

For a given EPN, we define H in (d) and H out (d) to be the set of nodes from which a 
path of length (at least) d begins or ends respectively. At large d, these will correspond 
to the H in and H out described earlier. We define J^(G) and Ad(G) to be the probability 
that a randomly chosen node from G is in H in (d) and H out (d) respectively. Sequential 
convergence means that yd(G n ) = yd(Gd) and Ad(G n ) = Ad(Gd) for n > d. We 
finally define 

y = iim y d (G d ) , 

a^oo 

A = lim Ad(G d ) . 

d — >oo 

y measures the probability of an epidemic and A measures the fraction infected. 

We will prove our results in the limit n — > oo by showing that H in (d) and H out (d) 
for a given G n are maximal or minimal under different conditions. This means that 
our results are generally true for arbitrary finite networks. The reason we use the large 
n limit is because for networks which are small it is unclear what constitutes a giant 
component in an EPN, or similarly, for a network with some unusual structure on a size 
comparable to the network size (for example a network made up of a few disconnected 
components), a giant component may not be uniquely defined. Using the large n limit 
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avoids these problems. We could avoid the need for a limit by instead assuming the 
existence of a giant strongly connected component in the EPN and showing that the 
same conditions maximize or minimize the probability a node is in the in- or out- 
component of this giant strongly connected component. 

3. Bounds in general networks 

We begin by considering the spread of infectious diseases on arbitrary networks. We 
begin with a simple lemma which we will need in this section and the next. 

Lemma 1. (Edge Reversal.) Given T uv = T(X U ,S V ), if we interchange the roles of 
infectiousness and susceptibility so that T uv — T(X V ,S U ) for all edges, then y and A 
interchange roles. 

Proof. If we replace T uv = T(1 U ,S V ) with T uv = T(X V ,S U ), then the new EPNs 
correspond to reversing the direction of edges in the original EPNs. Since reversing 
the direction of edges in an EPN interchanges Hi n (d) and H out (d), this interchanges 
y and A, and finishes the proof. □ 

We now make a simplifying assumption which we will need for networks with short 
cycles. 

Assumption 1. (Ordering Assumption.) If T(Xi,S\) > T(X2 : S\) for any S\, then 
T(X\,S) > T(X2,<S) for all S. Further, strict inequality occurs for a set of positive 
measure. Similarly if T(Xi,S{) > T(Z\,<S2) for any X\, then T(X,Si) > T(X, <S 2 ) for 
all X with strict inequality for a set of positive measure. 

The ordering assumption is a statement about the functional form of T(X,S). It 
places no restrictions on the network. The assumption holds for equation ([lj, but as 
noted earlier there are many scenarios where it fails. 

The ordering assumption implies that T out (X) and T in (S) are invertible mappings. 
It also allows us to assume that I is a scalar quantity ordered such that 

X U >X U , T(X U ,S) >T(X U ,,S) VS T out (u)>T out (u') 

and further X u > X u i T out {u) > T out {u'). We may make similar conclusions about 
S . There will be more than one way to represent X or S as scalars. It will frequently 
(but not always) be convenient to identify X with T out (X) and S with Ti n (S). 

Previous work by [TB] considered the spread of infectious diseases on networks for 
which the only heterogeneity came from variation in duration of infection. Hence all 
nodes have the same Tj„, and variation occurs only in T out . This model satisfies the 
ordering assumption. In this section we generalize the results of [TB] by allowing T in 
and T out to be heterogeneous simultaneously 

We will drop the ordering assumption in section [4] where we consider networks with 
no short cycles. Even in this section, many of the results hold without the ordering 
assumption, but the proofs are less clean. The assumption is only strictly needed for 
Theorems [l] [3] andg] 

We are now ready to show that increased heterogeneity generally decreases the size 
and probability of epidemics. We show that for a given Qi„ [resp. Qout], both y and 
A are maximal when T out [resp. Tj n ] is homogeneous. They are minimal when the 
variance of T out [resp. T in ] is maximal subject to the constraint of Qi n [resp. Qout]- 
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We can also derive conditions for a global upper bound on y and A. The upper 
bounds occur when T uv = (T) for all neighbors u and v. We hypothesize a lower 
bound, but cannot prove it in networks with short cycles. 

To make the notation cleaner in the following lemma, we identify S with Ti n and so 
we may use T(X, Tj n ) in place of T(X, S). 

Lemma 2. Assume a sequence of networks {G n } with sequential convergence of local 
statistics and a susceptibility distribution Qi n (Ti n ). Assume the ordering assumption 
holds and consider a distribution of infectiousness P\{X) with transmissibility given by 
Ti(X,Ti n ), that is consistent with Q in- Let 4>i n ,i(V,S) be as in equation pj. LetAi and 
yi be the corresponding attack rate and epidemic probability. Similarly choose another 
Pi^T), T2(2",Tj„) with corresponding A2, 3^2; and<p2(V,S). Assume that (pi n ,i(V,S) < 
4>in, 2(^7 S) for all V and S. Then A\ > Ai and y± > 3^2- 

Proof. Let d > be given. 

Take G n , n > d. We will show that a node in an EPN created from G n using the 
first distribution is more likely to be in H out (d) than a node in an EPN created using 
the second distribution. 

Choose any node u from G n . Partition the nodes of G n into disjoint sets {u}, U\, 
and U2 ■ To the nodes in U\ we assign X from Pi (X) and to the nodes in U2 we assign 
X from P2(X). We assign Ti n to all nodes from Qm(Tj„). We will consider the effects 
of adding u to U% versus adding it to Ui- 

Consider a partial EPN £ created by assigning edges (w,v) from all w =/= u, using 
T\(X Wl Ti(v)) if w G U\ and T2(X w ,Ti(v)) if w G Ui- Now consider an arbitrary node 
v! (which may be u) which is not already in Hi n (d), but which would join Hi n (d) if 
the appropriate edges were added from u. Let V be the set of neighbors v of u for 
which adding the edge (u, v) would allow a path from u' to u to be extended to a path 
of length d. 

We consider extensions of £ formed by placing u into U\ or Ui- The probability 
that vl would be in Hi n (d) in the extended EPN is equal to the probability that u has 
at least one edge to some node in V. This probability is at least as high if u E U\ as 
if u G U2 by our assumption (j>i n ,i{V,S) < 4>m,2(V,S). Consequently the probability 
of v! to be in Hi n (d) is maximal if u € U\. Induction on \U\\ shows that yd{G n ) is 
largest if all nodes are in U\ . 

We now show that u € Ui increases Ad compared with u S U2. We can prove that 
placing u in U\ versus U2 can only increase the probability of a node to be at the end 
of a length d path. The proof proceeds largely as above. Consider the same partial 
EPN £ defined above. Let vl be a node which is not in H out (d) but would be if an edge 
from u to any v G V (note that u ^ u'). The probability that u' will be in H out (d) is 
( t ) in,iiy i S) or (frin^iy, S) depending on whether u is assigned X from P\ or P2. Because 
4>in,i(V,S) < 4>in,2(V,S) it follows that Ad is largest if u G U±. Induction on \U\\ shows 
Ad(G n ) is maximal if all nodes are in U±. 

Taking d — > 00 , it follows then that y and A are maximal if all nodes are in Ui , and 
so the proof is finished. □ 

We begin by showing that for fixed distribution of in-transmissibility, the size and 
probability are largest when the out-transmissibility is homogeneous. 
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Theorem 1. Lei Qi n (T in ) be given. Assume that the ordering assumption holds and 
that {G n } satisfies sequential convergence of statistics. Set S v = T in (v). Then y and 
A are maximized when T(T, Ti n ) = Tj n . 

Proof. By the ordering assumption, we may take T to be scalar with X\ > Ti iff 
T ou t(Ti) > T out (T.2). This allows us to use Chebyshev's "other" inequality [15]: if h\ 
and hi are decreasing functions of x and p is a probability density function, 



hi{x)h2{x)p{x) dx > 



h 1 {x)p{x) dx 



h,2{x)p{x) dx 



By induction J[Y[hj(x))p(x) dx > Y[ J hj{x)p{x) dx for any number of decreasing 
functions hj. 

Applying this to the decreasing function hj{T) = 1 — T(T, Ti n (vj)) we have 



4>in{V,S) 



n kiV) 

LvEV 

> Y[l-T in (v) 



P(I)dl, 



with equality if T(I, Tj n ) = Tj n . Thus by Lemma[2] A and 3^ are maximal, completing 
the proof. □ 

We have proven the upper bounds given Qi n (Ti„) occur when T out is homogeneous. 
We now show the lower bounds occur when T out is maximally heterogeneous. Because 
of the ordering assumption, we may take S v = Ti n {v). 

Theorem 2. Let Qi n (Ti n ) be given, assume the ordering assumption holds, and as- 
sume that {G n } satisfies sequential convergence of statistics. Take T to be chosen 
uniformly from [0,1]. Setting 



T(l,T in ) 



T m < 1 

1 T m > I 



(5) 



minimizes y and A. 

Proof. Given equation ^5l, we have 4n n {V,S) = min^gyjl — T m {v)}. 

We need to prove that for any arbitrary transmission function T(I,Ti n ) satisfying 
the ordering assumption and consistent with T in , 4> in (V,S) < min^yjl — T in (v)}. To 
do this, let T be given, T„ assigned to v\, . . . , v n and assume v±, . . . , v n are ordered 
such that T in (vi) > T in (v 2 ) >■■■> T in (v n ). 

Then 



« n 

(V,S)= / H[l-f(l,T in (vj))]P(l)dZ, 

J 3 = 1 



< 



[l-T(l,T in ( Vl ))]P(T)dI, 



< 1 - T in (vi) . 

This shows that for any T, <fii n (V, S) is at most the value it takes for ([5]). Thus Lemmaji] 
shows that y and A are minimal, completing the proof. □ 
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We derived the results above with fixed Qi n . Lemma [T] shows that the equivalent 
results must hold for Q ou t- 

Theorem 3. Let Q ut{T out ) be given. Assume that the ordering assumption holds and 
that {G n } satisfies sequential convergence of statistics. SetX u — T out (u). 

' If 

T(T out ,S) 

— Tout 

Then y and A are maximized. 
• If S is chosen uniformly in [0, 1] and 



T(T out ,S) 



T out < S 

1 Tout > S . 



then y and A are minimized. 

Proof. This follows immediately from Lemma [T] with Theorems [T] and [2] □ 

We now give a global upper bound for both y and A. 

Theorem 4. Let (T) be given. Under the ordering assumption with sequential con- 
vergence of statistics for {G n }, the maximum of y and A occur when T uv = (T) for 
all neighboring nodes. 

Proof. Consider a P(T), P(S), and T(T,S) which yields a global maximum for 
either y or A. If Ti„ is not homogeneous, then we can find a new infection process 
which preserves the same Qout(T ou t) with homogeneous Ti„ which can only increase y 
or A. A repeated application preserving the new homogeneous in-transmissibility, but 
now making T out also homogeneous again can only increase y or A. T ou t and T in are 
then homogeneous. This completes the proof. □ 

We finish with a conjecture about global lower bounds. 

Conjecture 1. Under the ordering assumption with sequential convergence of statis- 
tics for {Gn}, the minimum of y occurs when Qout(T ou t) = (T) S(T ou t — 1) + (1 — 
(T))S(T out ). 

The minimum of A occurs when Qi n (Ti n ) — (T) 8{Ti n — 1) + (1 — (T))6(Ti n ). 

Note that if Qout(Tout) = (T) S(T out - 1) + (1 - (T))6(T out ), then Q m (T m ) = 
$(Tin — (T)) is homogeneous. 

3.1. Discussion 

The results of this section have focused on extending earlier results of Kuulasmaa [TB] 
who considered a population with homogeneous susceptibility and heterogeneities in 
infectiousness due entirely to variation in duration of infection. We have extended 
these results to cover a wide range of heterogeneities in infectiousness and susceptibil- 
ity (simultaneously), under the assumption that infectiousness and susceptibility are 
assigned independently. In order to extend the proof used by Kuulasmaa, we have 
been forced to make the ordering assumption, which effectively means that if we order 
people by how infectious they would be to one test susceptible individual, the order is 
the same as we would find for another test susceptible individual. We do not have any 
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counter-examples to these theorems in the case where the ordering assumption fails, 
and so it is not clear that it is needed. In section [4] we will see that similar results hold 
in unclustered networks without needing the ordering assumption. 

Our results show that in general increasing the heterogeneity of the population is 
useful for either decreasing the size or decreasing the probability that an epidemic 
occurs. Given Q in [resp. Q ou t], both y and A are maximized if T out [resp. T in ) is 
homogeneous and minimized if it is maximally heterogeneous. Similarly, given just 
(T), we find that the global maxima of y and A occur when T = (T). Perhaps 
surprisingly, the conditions leading to upper and lower bounds are independent of the 
network, though the size of the variation between these bounds is network-dependent. 

Although we can prove lower bounds given Qi n [or Qout), we cannot prove global 
lower bounds given (T). We hypothesize that the global lower bound for y occurs 
when Q ou t is maximally heterogeneous and the global lower bound for A occurs when 
Qi n is maximally heterogeneous. In the next section we will see that these are the 
lower bounds for an unclustered population. However, we have not found a rigorous 
proof for general networks. In the proof of the upper bound, we took a given Q ou t and 
found Qin that maximizes y and A. We then held that Qi n fixed and found Q ou t to 
maximize, arriving at the upper bound. However, applying a similar technique to the 
lower bound fails because given any Q ou t, if we find a minimizing Qi„, attempting to 
then minimize with Qi n fixed simply returns the original Qout- The difficulty results 
from the fact that increasing heterogeneity in T ou t restricts the amount of heterogeneity 
in and vice versa. 

4. Bounds in unclustered networks 

Most studies of infectious diseases spreading on networks have been made for net- 
works for which the effect of short cycles may be neglected |25] . These investiga- 
tions have generally used Molloy-Reed networks [23] (also known as the configuration 
model [53]). The theory we develop here applies to these networks, but also to more 
general networks which may have degree-degree correlations, or even longer range 
correlations. 

When we study networks with no short cycles, we are able to prove stronger results 
and abandon the ordering assumption. We find that y depends on the network and 
Qout (T ut) only, while A depends on the network and Qi n {Ti n ) only. We can prove 
global upper and (unlike in the general case) lower bounds on y and A. 

Assumption 2. (Unclustered Assumption.) Given a sequence of networks {G n }, we 
assume that G n has girth greater than In. 

This assumption means that Bd(u) chosen from any G n with n > d must be cycle 
free. In particular, there is no alternate path between a node and a neighbor. It was 
this complication that forced the use of the ordering assumption earlier, and since 
the complication no longer exists, we drop the ordering assumption. The unclustered 
assumption will also allow us to use ipi n (V) rather than 4>i n (V, S). Thus we only require 
the marginal probability of the set of nodes V not to be infected to satisfy an inequality, 
rather than the inequality be satisfied for every possible set of susceptibilities. We must 
bear in mind that knowing Ti n or T out no longer uniquely determines X or S . 

Lemma 3. Let the sequence {G n } satisfy the unclustered assumption with sequential 
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convergence of statistics. Take P\(I), P\(S) and Ti(T,S). Let i^i n i(V) be as in 
equation Q. Similarly take P 2 (X); P2{£)> an d T^iljS) with corresponding ipin,2(V)- 

ifTp in ,i(v) < ^„, 2 (V) then y 1 >y 2 . 

Proof. This proof is similar to that of Lemma [2j 

Let d > be given. Take G„, n > d. Choose a node u from G n and partition the 
nodes of G n into {u}, U\, and U^. To the nodes in U\ we assign I from P\(T) and to 
the nodes of U2 we assign I from Pi^X)- To each node w (including u), we assign two 
susceptibilities, S w 1 and S w 2 such that 5™ 1 comes from Pi{S) and S w 2 comes from 

We create a partial EPN £ as follows. For each v € Z7i, we assign edges (v,w) 
using Ti(Iu,»S Wi i), and for u e U2 we assign them using T 2 (T V , S Wt 2)- We do not 
yet assign edges from u (but edges may point to u). Consider any v! not in Hi n {d) 
which would join Hi„(d) if an edge was added from u to any v £ V . By assumption, 
i>in,i{V) < ipi n ,2(V) and so the probability is greatest if I u is chosen from P\{I). It 
follows that y>i > 3^2- This completes the proof. □ 

This proof can be modified to work on clustered networks without the ordering 
assumption, so Lemma [2] does not require the ordering assumption. However, the 
proof is more technical and provides little additional insight, particularly because the 
main results following from Lemma [2] do require the ordering assumption. 

Theorem 5. Let the sequence {G n } satisfy the unclustered assumption with sequential 
convergence of local statistics. Let Qi n (Ti n ) be fixed. Then A is fixed. 

Proof. We follow the technique used to prove A is larger for one distribution than 
the other in Lemma [2] However, in following that proof, the lack of clustering means 
|V| = 1. Since for any distribution ip(V) = 1 — (T) when \V\ = 1, all distributions 
must give the same A, and the proof is finished. □ 

Theorem 6. If the assumptions of Theorem [5] hold except that Q ou t is fixed rather 
than Qi n , then y is fixed. 

Proof. This follows immediately from Lemma [l] and Theorem [5] □ 

Theorem 7. Let Qi n be given. Assume that {G n } satisfies the unclustered assumption 
with sequential convergence of statistics, y is maximized when T(T,S) = T,- n (<S). 

Although this result is analogous to Theorem [TJ the proof is fundamentally altered 
because we no longer have the ordering assumption. 

Proof. We first note that if T(I,S) = T in (S), then T out = (T) for all nodes. 

Now consider an arbitrary function T(T, S) with P(I) and P(S) to satisfy Qi n (Ti n ). 
The function (1 — T out )^ v ^ in equation Q is convex, so by Jensen's inequality ipi n is 
minimized by T out = (T). Lemma [3] completes the proof. □ 

Theorem 8. Let Qi n be given. Assume {G n } satisfies the unclustered assumption 
with sequential convergence of statistics, y is minimized when T is chosen uniformly 
from [0, 1] and 

11 KT in {S). 
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Proof. Following the proof of Theorem [2] we may show that <p in is maximized 
(subject to Qi n ) exactly when these assumptions hold. Thus from equation ^ ifj in is 
also maximized when these assumptions hold. Lemma [3] completes the proof. □ 

Theorem 9. Let Q ou t be given. Assume {G n } satisfies the unclustered assumption 
with sequential convergence of statistics. 

• A is maximized when T(T,S) — T out (T). 

• A is minimized when S is chosen uniformly from [0, 1] and 



T(1,S) 



S>T out (l) 

1 S<T out (I) 



Proof. This follows from Theorems [7] and [8] with Lemma [T] □ 

Before proving our final result, we introduce a lemma. 

Lemma 4. Let f(x) be a convex function on [0, 1] and p{x) be a probability density 
function on [0, 1], with expected value pq. Then 

f(x)p(x)dx<(l-p )f(0)+p o f(l). 



Proof. The definition of convexity gives 

f{x) < (1 - x)/(0) + xf(l) . 

Thus 

J f{x) P {x)dx < A(l - a;)/(Q)p(aO + xf(l)p(x)]dx < (1 - p )f(0) + p f(l) . 

□ 

Theorem 10. Let {G n } be a sequence of networks satisfying the unclustered assump- 
tion with sequential convergence of statistics. Assume that (T) is given: 

• The global upper bound for both y and A occurs when T uv — (T) for all pairs 
of neighbors. 

• The global lower bound for y occurs when Qout(T out ) = (T) 5{T out — 1) + (1 — 
(T))S(T out ). 

• The global lower bound for A occurs when Qi n (Ti n ) = (T) S(Ti n — 1) + (1 — 
(T))S(T m ). 

Proof. The proof of the upper bound is identical to that of Theorem [4] 
We prove the lower bound for y. The lower bound for A follows from Lemma [T| 
We have ip m (V) = j(l - T out )\ v \Q out {T out )dT out . We now seek to find Q out which 
maximizes ipi n in order to apply Lemma [3j 

Since (1 — T out )' [V ' { is a convex function, we may apply Lemma [4] with Q ou t playing 
the role of p. The maximum occurs when Q ut(T out ) — (T) S(T out — l) + (l — (T))5(T out ) 
and so Lemma [3] finishes the proof. □ 
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Although the upper bound for both y and A occurs when T uv = (T) for all pairs, 
our earlier results show that for unclustered networks y depends only on Q ou t(T out ) 
and the network, and so as long as T out {u) = (T) for all nodes u, we achieve the upper 
bound on y (but not on A). Symmetrically, if Ti n (v) — (T) for all nodes v, we achieve 
the upper bound on A. 

Note that the lower bound for A requires that T out (u) = (T) for all u, and so the 
population is homogeneously infectious. It follows from Theorem [8] that y is then 
maximal. Similarly the lower bound for y requires that A be maximal. 

4.1. Discussion 

The results of this section generalize those of [5T] which considered the special case 
of Molloy Reed networks. These results prove that the same scenarios give upper 
and lower bounds in unclustered networks with a wide range of correlations including 
assortative or disassortative mixing (high degree nodes preferentially joining with high 
or low degree nodes respectively), or longer range correlations. Although we proved 
these under the assumption that no short cycles exist, the results remain useful in 
networks with either few short cycles, or for situations in which the transmissibility is 
low enough that the short cycles are only rarely followed. 

Because of the lack of short cycles, the ordering assumption is not needed. This 
means that our results apply to a much wider class of disease transmission mechanisms, 
but at the cost of restricting the network. Again we find that which conditions give 
the upper or lower bound is network-independent. The amount of variation there is 
between these bounds, however, is network-dependent. 

The main distinction from clustered networks is that y depends only on the network 
structure and Qout(T ou t)- That is, y is independent of Qm(T in ). Similarly A depends 
only on the structure and Qi n (Ti n ). We note that unless the effect of clustering is 
very large, the dependence of y on in-transmissibility and A on out-transmissibility 
will be weak. Curiously the global lower bound for A found in Theorem [10] requires 
that Q out (T ou t) = S(T ou t — (T)), and so the population is homogeneously susceptible. 
It follows from Theorem [8] that y is then maximal. Similarly the lower bound for y 
requires that A be maximal. This has important implications for policy design because 
strategies to reduce T tend to have a heterogeneous impact on either S or X. 

5. Conclusions 

We have extended earlier work on the effect of heterogeneity in infectiousness on 
the spread of epidemics through networks. Our extensions allow for heterogeneity 
in susceptibility as well. Many of the results are similar. In general we find that 
the size and probability of epidemics are reduced if the population is more heteroge- 
neous. Unfortunately, increasing heterogeneity in susceptibility restricts the level of 
heterogeneity possible in infectiousness. In the extreme case where susceptibility is 
maximally heterogeneous, infectiousness must be homogeneous. Perhaps surprisingly, 
we have found that the distributions leading to upper and lower bounds on y and A 
are network-independent. 

Early in an outbreak, it is likely that we may gain some information about Q 0U t(T ut)- 
For example, in the early stages of the SARS epidemic, it was known that a number 
of people were highly infectious, while the rest were only mildly infectious and so 
Qout(T ut) was highly heterogeneous. However, there was little information on Qi n . 
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Once given the distribution of Q ou t, the results here show which distributions of Q in 
give the largest or smallest y and A. Our results further suggest that the distribution 
of infectiousness found for SARS is consistent with a low epidemic probability. It is 
difficult to extrapolate from observations what the sizes would have been without the 
interventions put into place, but the fact that a number of isolated cases occurred 
throughout the world without sparking local epidemics suggest that the probability of 
an epidemic from each introduction was low, consistent with our predictions. 

Our results further suggest that in order to prevent an epidemic, it is best to take 
measures that will have a heterogeneous impact on infectiousness, but in order to affect 
the size of an epidemic, it is best to take measures that will have a heterogeneous impact 
on susceptibility. In terms of actual interventions, we compare two strategies aimed at 
controlling a disease which is initially spreading with homogeneous T: in the first we 
devote resources to vaccinating half of the population, while in the second we devote 
them to identifying and removing half of the infected population. Both strategies 
reduce (T) by a factor of 2. In the first, the susceptibility is highly heterogeneous, but 
the probability an infected node infects a randomly chosen neighbor has simply gone 
down by a factor of 2, and so it remains homogeneous. Assuming that the unclustered 
approximation is valid, this maximizes the impact on size, but the impact on probability 
is minimized. In contrast, the second strategy maximizes the impact on probability, 
but minimizes the impact on size. 

Acknowledgments 

This work was supported in part by the Division of Mathematical Modeling at the 
UBC CDC and by DOE at LANL under Contract DE-AC52-06NA25396 and the DOE 
Office of ASCR program in Applied Mathematical Sciences. 

References 

[1] Abbey, H. (1952). An examination of the Reed-Frost theory of epidemics. Human 
Biology 24, 201-233. 

[2] Anderson, R. M. and May, R. M. (1991). Infectious Diseases of Humans. 
Oxford University Press, Oxford. 

[3] Andersson, H. (1999). Epidemic models and social networks. Math. Scientist 
24, 128-147. 

[4] Ball, F. (1985). Deterministic and stochastic epidemics with several kinds of 
susceptibles. Advances in Applied Probability 17, 1-22. 

[5] Ball, F. and O'Neill, P. (1999). The distribution of general final state random 
variables for stochastic epidemic models. Journal of Applied Probability 36, 473 
491. 

[6] Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., 
Stata, R., Tomkins, A. and Wiener, J. (2000). Graph structure in the web. 
Computer Networks 33, 309-320. 



16 Joel C. Miller 

[7] Del Valle, S. Y., Hyman, J. M., Hethcote, H. W. and Eubank, S. G. 
(2007). Mixing patterns between age groups in social networks. Social Networks 
29, 539-554. 

[8] Eubank, S., Guclu, H., Kumar, V. S. A., Marathe, M. V., Srinivasan, 
A., Toroczkai, Z. and Wang, N. (2004). Modelling disease outbreaks in 
realistic urban social networks. Nature 429, 180-184. 

[9] Hastings, M. B. (2006). Systematic series expansions for processes on networks. 
Physical Review Letters 96, 148701. 

[10] Keeling, M. J. (2005). The implications of network structure for epidemic 
dynamics. Theoretical Population Biolology 67, 1-8. 

[11] Keeling, M. J. and Eames, K. T. D. (2005). Networks and epidemic models. 
Journal of the Royal Society Interface 2, 295-307. 

[12] Kenah, E. and Robins, J. M. (2007). Network-based analysis of stochastic SIR 
epidemic models with random and proportionate mixing. Journal of Theoretical 
Biology. 

[13] Kenah, E. and Robins, J. M. (2007). Second look at the spread of epidemics 
on networks. Physical Review E 76, 36113. 

[14] Kermack, W. O. and McKendrick, A. G. (1927). A contribution to the 
mathematical theory of epidemics. Royal Society of London Proceedings Series A 
115, 700-721. 

[15] Kingman, J. F. C. (1978). Uses of exchangeability. The Annals of Probability 
6, 183-197. 

[16] Kuulasmaa, K. (1982). The spatial general epidemic and locally dependent 
random graphs. Journal of Applied Probability 19, 745-758. 

[17] Kuulasmaa, K. and Zachary, S. (1984). On spatial general epidemics and 
bond percolation processes. Journal of Applied Probability 21, 911-914. 

[18] Madar, N., Kalisky, T., Cohen, R., ben Avraham, D. and Havlin, S. 
(2004). Immunization and epidemic dynamics in complex networks. The European 
Physical Journal B 38, 269-276. 

[19] Meyers, L. A. (2007). Contact network epidemiology: Bond percolation applied 
to infectious disease prediction and control. Bulletin of the American Mathematical 
Society 44, 63-86. 

[20] Meyers, L. A., Newman, M. and Pourbohloul, B. (2006). Predicting 
epidemics on directed contact networks. Journal of Theoretical Biology 240, 400- 
418. 

[21] Miller, J. C. (2007). Epidemic size and probability in populations with 
heterogeneous infectivity and susceptibility. Physical Review E 76, 010101. 



Epidemic Size and Probability 



17 



[22] Mollison, D. (1977). Spatial contact models for ecological and epidemic spread. 
Journal of the Royal Statistical Society. Series B (Methodological) 39, 283-326. 

[23] Molloy, M. and Reed, B. (1995). A critical point for random graphs with a 
given degree sequence. Random structures & algorithms 6, 161-179. 

[24] Neal, P. (2007). Copuling of two SIR epidemic models with variable susceptibil- 
ities and infectivities. Journal of Applied Probability 44, 41-57. 

[25] Newman, M. E. J. (2002). Spread of epidemic disease on networks. Physical 
Review E 66, 16128. 

[26] Newman, M. E. J. (2003). The structure and function of complex networks. 
SIAM Review 45, 167-256. 

[27] Pastor- Satorras, R. and Vespignani, A. (2001). Epidemic spreading in 
scale-free networks. Physical Review Letters 86, 3200-3203. 

[28] Serrano, M. and Boguna, M. (2006). Percolation and epidemic thresholds in 
clustered networks. Physical Review Letters 97, 088701. 

[29] Trapman, P. (2007). On analytical approaches to epidemics on networks. 
Theoretical Population Biology 71, 160-173. 

[30] van den Berg, J., Grimmett, G. R. and Schinazi, R. B. (1998). Dependent 
random graphs and spatial epidemics. The Annals of Applied Probability 8, 317- 
336. 



